AITopics | knowledgeable verbalizer

Collaborating Authors

knowledgeable verbalizer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian

Auriemma, Serena, Miliani, Martina, Madeddu, Mauro, Bondielli, Alessandro, Passaro, Lucia, Lenci, Alessandro

arXiv.org Artificial IntelligenceJul-30-2024

Pre-trained LMs have had a significant impact on Natural Language Processing (NLP), with the "pre-train and fine-tune" paradigm rapidly becoming the predominant approach to apply effective models on a wide variety of downstream tasks [1-3, inter alia]. However, one of the main concerns when working with LMs is the paucity of annotated data, especially for specific domains or low-resource languages, required to fine-tune the additional classification layer on top of these models for downstream tasks, such as classification. Recently, prompt-based tuning has started to affirm as a promising way to perform similar tasks, significantly reducing the need for annotated data. This approach has been proven to be very effective with Large Language Models (LLMs) [4]. However, it is often the case that LLMs are not available for low-resource languages, and that their performance drastically decreases when they are challenged on specific domains. Moreover, in the Digital Transformation era, businesses frequently need to integrate artificial intelligence systems into their application ecosystems. This requires them to utilize specialized, publicly available models while also employing effective methods to leverage these models in scenarios where annotated language resources are unavailable, thereby operating in a zero-shot mode. Hence, we decided to evaluate two smaller domain-specific encoder models: BureauBERTo [5], a LM further pre-trained on Italian bureaucratic texts (i.e., administrative acts, banking and insurance documents), and Italian Legal BERT [6] (henceforth referred to as Ita-Legal-BERT), a LM adapted to the Italian legal domain, on various classification tasks on domain-specific data exploiting a prompt-based technique in a zero-shot scenario. Additionally, we compared the performance of both models with that of a generic Italian model, UmBERTo.

calibration, knowledgeable verbalizer, verbalizer, (17 more...)

arXiv.org Artificial Intelligence

2407.20654

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Industry: Law > Statutes (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

A Few-shot Approach to Resume Information Extraction via Prompts

Gan, Chengguang, Mori, Tatsunori

arXiv.org Artificial IntelligenceMay-19-2023

Prompt learning's fine-tune performance on text classification tasks has attracted the NLP community. This paper applies it to resume information extraction, improving existing methods for this task. We created manual templates and verbalizers tailored to resume texts and compared the performance of Masked Language Model (MLM) and Seq2Seq PLMs. Also, we enhanced the verbalizer design for Knowledgeable Prompt-tuning, contributing to prompt template design across NLP tasks. We present the Manual Knowledgeable Verbalizer (MKV), a rule for constructing verbalizers for specific applications. Our tests show that MKV rules yield more effective, robust templates and verbalizers than existing methods. Our MKV approach resolved sample imbalance, surpassing current automatic prompt methods. This study underscores the value of tailored prompt learning for resume extraction, stressing the importance of custom-designed templates and verbalizers.

data mining, natural language, template, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-031-35320-8_32

2209.0945

Country: Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.04)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.87)
Information Technology > Data Science > Data Mining > Text Mining (0.63)

Add feedback